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Abstract 



There has been increasing attention to the fine structure of abilities un- 
derlying task performance (Haertel and Wiley (in press)). One of the useful 
approaches for this is by use of graphical models to represent relationships 
among abilities and test items. Building a large graphical model is always 
an issue. Restrictions upon experiments and data collection, among others, 
may result in parts of the large model. Or it may be convenient for us to 
build parts of the large models first, and then try to combine those parts into 
a larger model. This paper derives, confined to categorical variables only, a 
theory which may be useful in combining conditional graphical models into a 
larger one. The main result of the paper is that we can see partial information 
about a true log-linear structure (LLS) from its conditional LLSs and use tlie 
information in trying lo guess the true LLS, assuming that the true LLS is 
graphical. An application of the result is illustrated using a simulated data 
set. 

Key words: log-linear model, strong hierarchy assumption, liyporniodcl. 
traccatjility, influence diagravn. 
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1 Introduction and Problem 



During the last two decades, psychological research has beeii focused on 
tasks that better approximate the meaningful learning and problem-solving 
activities that engage people in real life. There has been increasing attention 
to the fine structure of abilities underlying task performance (Haertel and 
Wiley (in press)). As Haertel and Wiley (in press) note, current test theory 
may not be sufficient to characterize the fine structure of the ability pattern 
and hence to build upon this structure to more clearly represent the acquisition 
and the structure of aggregate abilities. One of the appropriate approaches in 
dealing with this kind of issues is by use of graphical models by which wc can 
represent relationships among .ihilities and test items. 

In this paper, "graphical models" includes graphical log-linear models, in- 
fluence diagrams and Bayes Networks. One of the problems that wc face when 
wc try to build graphical models of task abilities and test items is described 
below. Consider the test items which can be solved in several different ways. 
Problem-solving strategies may vary across a group of individuals. For exam- 
ple, in dealing with mixtures of whole numbers and fractions, a student may 
prefer dealing with those numbers in a fraction form, while another in a mixed 
form (Tatsuoka (1990)). Skills used in solving a problem may vary accord- 



ing to the strategies selected by test-takers. Some skills used in one strategy 
may not be used in another. This is one of the reasons why it is desirable 
to build models in two steps when there are more than one strategy available 
— first, we build graphical models for each problem-solving strategy; second, 
we combine them into a graphical model where a categorical variable tor the 
problem-solving strategies is included as a new variable. 

The main goal of this paper is to derive a rule which is useful when com- 
bining graphical models, where the same set of variables are involved in each 
of the graphical models. The above-mentioned three types of graphical models 
share a common ground (see Lauritzen and Spiegelhalter (1988), Pearl (1988). 
Smith (1989)). That is, Bayes networks and influence diagrams are inter- 
pretable in terms of graphical log-linear model under a posit ivity condition 
that every configuration of the variables involved in the model has i)Ositiv(> 
probability. Hence in our derivation of the combining rule our discussion will 
be in the context of graphical log-linear model. 

We will define by log-linear stnuanr (LLS) the generating class of a lof;- 
liriear model. If the log-linear model is graphical, then its LLS can be rei)re- 
sented in the form of a graph. Suppose t.hat a categorical variable X has / 
levels, and l,hat we have / conditional LLSs for a set of variables, corresiroiidiiig 
to the / possible outcomes of X In these circumstances it s(vnis desirable to 

li 
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Figure 1.1: Hybrid tree/log-linear structure representation 




LLS\ LLS2 LLSi-i LLSi 

develop hybrid tree/log-linear structure representations. Such a hybrid rep- 
resentation is given i i Figure 1.1. We will sinnply call such a representation 
a hybrid. A hybrid can be described in ternns of log-linear structures only, 
instead of a graph. For example, Figure 1.1 can be expressed by 

{LLSuLLS2,--- ,LLS,), 

where the subscripts indicate the values of X. 

There are only finitely many possible log-linear structures corresponding 
to a hybrid, if the hybrid involves only a finite number of random variables. 
We will call the log-linear structure of a log-linear model which gives rise to 
the hybrid h, say, by a hypermodel corresponding to (abbreviated to "c.t.") h. 
Tliere may be many hypermodels corresponding to a hybrid. 

This paper consists of 7 sections. In sections 2 through 5, we derive a rule 
for obtaining hypermodels from a given hybrid. Section 2 deals in detail witli 
a conditional log-linear model. The possible conditional log-linear structures 

4 

9 



of a given log-linear structure are discussed in section ?>. In section 1. we 
derive basic rules of obtaining the set of the possible graphical hyperniodrls 
corresponding to a given hybrid. Section 5 shows that we can find the true 
graphical hypermodel with some uncertainty by applying the rules derived i.i 
section 4. Section 6 illustrates combining two graphical models by applying 
the main results of this paper. Section 7 concludes the paper. 

In the rest of this paper, we will use abbreviations (in the parenthesis) ,'is 
follows if confusion is not likely; log-linear model (LLM), log-linear stiurture 
(LLS). conditional log-linear model (CLLM), conditional log-linear stnuturc 
(C'LLS). 
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2 A General Expression for the CLLM and 
the Strong Hierarchy Assumption 

Suppose there are n categorical variables A'l , A'2, ■ ■ ■ . A'„ , for which we will 
consider LLMs under the hierarchy assum])tion. We will l)orrow most ol the 
notations from Bishop, Fienberg, and Holland (1980). In this paper, we will 
ronsider conditional probabilities of a set of variables giv(Mi an outroiiic ol A, 
variable, and assume that A'l takes on /] values ,rn '"i /, • 1'*'' 0\.(>i~ ■ • ■ - (U 
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be distinct subsets of {1, 2, • • • , n}. Consider a LLM whose generating class is 
given by {f>i,e2, ••• namely, 

logpi2...n(a^i,a^2, • • • ,a^n) 

where i?(xi, X2, • • • , xjt) is a constant u-term plus the summation of the ue- 
terms each of whose subscript sets, ^'s, is a strict subset of some 9i. 

We denote by P23 - *:|i(a^2, a^3, • • • , a^*:|a^i) the conditional probability that 
Xz = X2,X3 = xs, - ■ ■ ,Xk = Xk, given Xi = xi. Then we have that 

k 

= J2ug,(X,B^) + R{XuX2,---,X^,) -\0gpi{xi). (2.2) 



For notational convenience, we will omit the argument of the subscript set 9 
when confusion is not likely. For 9 such that ^ n {1} = 0( the empty set), we 
let 

Xe) = -To) + "{l}u9(x,, Xe)- (2.3) 

Then, we can re-express equation (2.2) in terms of the usual u-terms and the 
n'^-ternis. At this point, we need to know pi(.r)). Assume Lhat 

ui 7^ 0 in equation (2.1). (2.'1) 
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Let 

c(xO =u + Ui(^.j) - logpi(;ri). ('i-.Vi 

In (2.2), we may rearrange, without loss of generality, ^'5 so that Oi, - ■ ■ ,6r. 
for 1 < r < A;, contain the singleton {1} as a subset, while the other O'h do nol , 
Under this rearrangement, let 6'- = Oi \ {!}, for i - 1,2, • • • ,r. Let - 0. 
while u symbolizes a constant term in a LLM. Then we have, from (2:1) and 
(2.5), that 



(^1) 
».'( i 

i=l i=r+l 

R^^'Ux;, • • • , .T„) - u - + r(.i-, ). (2.(11 



ERIC 



where 

r 

/?.(">)(X2,- • ■,Xn) = n{xuX2,-- • ,a:„) -Y^ue^ x,,)- (2.7) 

.=1 ~ • 

Note that Ey'4''( x ,) ~ where the summation gues over all the possible 
value y's of Xj for each j € 0'-. 

A hierarchical LLM for a joint probability of a set of variables does not net - 
essarily imply a hierarchical LLM for any of its conditional probabilities, l-'ur 
example, consider a LLM whose generating class is given by { { 1 . 2} , { ! . 1 } } . 
Then from equation (2.6) we have 

log/'■23.l|l(•'■•2^.'•.i,•'■.)|•'•l) - 

7 

12 



In this expression, i^^^-^ or u^r^j^ may be equal to 0, while uja'jj may not. 

However, we will confine ourselves, in this paper, to the cases where the 
hierarchy principle holds in the LLMs for the joint probability and for the con- 
ditional probabilities. We will refer to this restriction ihe strong hierarchy 
assumption (SHA). 

We denote a CLLM of a set of variables given .Yi = xn by CM/, / = 
L • • • . and the LLS of CMt by C6', . In expression (2.6), there may be some 
0', such that 0 C <?J C for some j, r + l<j<k. Under the SHA, we don't 
care about such ?/^f"^-terms in expressing CSt, and throw such z4'^"^-tcnns 
into /?(''"^-term. In this sense, we may '.-efer to those terms as the disapixaring 
(/^■^-terms, the rest of them, a.^ the rcmainiiuj 7i("^-terins. Suppose there are s 
disappearing u^'^-terms, and r — s remaining u^'^-tcrms, for 0 < s < r, in the 
r.li.s. of equation (2.6). Without loss of generality, we may then rearrange the 
/^l,';"'^- terms, so that 7Af"\ i = 1,2. • • • ,.s, disappear into /?('^")-t.enn. We now 
have 



:=s+l ' :=r+l 
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wliere 

.1 r 
1=1 ' i=s+] 

(2.8) 

From (2.7) and (2.8), we can see that /2+"^(x2, • • • ,Xn) is a liue.ar combination 
of the u-terms each of whose subscript sets is a strict snbset of some sot in 
{^Ui' ■ ■ ■ ' Or^Sr+i,- ■ ■ , Ok}- Actually is composed of the following 3 lypes 

of u or terms: 

(a) the ui^"'-ternis, for i = 1, 2, • • • , s, with <lS C 0" C 0'-, 

(b) the i4»"'-terms, with % C 0" C 0\, for ?: = .s+ 1, ■ ■ • ,i\ excluding the Xvrm^ 

in (a), 

(c) the u-terms of R{x\i,- ■ ■ ,t„) —ti — except those terms in (a) and 

(b). 

Rewriting logp2-„|i(;C2, • ■ • ,;r„|Ti,) gives 
logp2 -ri|l(.C2, • • • ,x„|;ci() 

r k 

= Yl + Yl •!«,) + (''') 

1 = 5+1 ' ' ' i=r+l 

■fsum of (b) terms + sum of (c) terms + r(.rit). (2.*)) 

This result is true whether ( 1 } is one of in (2.1) or not. i.e.. wlictlici 

9 
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R{xi,x-2, • • • ,av, ) has ui-term in its summation or not. Note tliat c(xi/) is tlie 
constant term in expression (2.9). 

Before concluding this section, we need to note that the u-terms in the 
second summation in expression (2.9) are all non-zero since they remain the 
same as they are in the original model (2.1). On the other hand, some n^ '- 
term in the first summation can be zero, in which case some (b) term may 
affect the CLLS. 

In tliis section, we examined CLLM's and derived a generalized expression 
(2.9). Under the SUA, the CLLS CSi, /= 1, •••,/!, is determined by u and 
u^ '-terms in 

(i) the first two summations in (2.9), if none of the terms is zero; 

(ii) the first, second and fourtli summations in (2.9), if vsome ?/* '-terni in tlie 
first summation is zero. 



3 The CLLSs in a Hybrid Given a Hyper- 
model 

In Section 2, wr liave seen tliat the two (^LLSs can be difTcrent. From the 
CILLSs, we can guess the possible hypermodels. Tlie relationsliip between tlie 



C'LLSs and the hypermodel is examined in this section. The following lemma 
plays an important role in searching for the possible CLLSs. 

Lemma 3.1 Let 6 f] {1} = 0. Then, ue = u^ijue = 0, iff u^g""^ = 0, for all 

Proof: Suppose that "^('is) = 0, V xe, for all / = 1, ■ ■ ■ , /]. Then, it follows 
that, for / = 1, • • • , /i, 

Since J^'iLi "{i}u9(xh, Xg) ~ 0, we have, by equation (3.1), = 0. Hence, the 
"if pari" is proved. 

The proof of the other direction is straightforward. □ 

The following example illustrates what CLLSs are possible by applying 
Lemma 3.1. 

Example 3.2 Suppose that /j = 2, and that tiir. LLS S for a 6-<liw< vsional 
contingency table, is given by 

II = {{1,2,3}, {1,3,4}. {1,3,6}, {3.4,5}, {:?,5,6}}; (3.2) 

;7 is given in (jraphieol form, ns in Figure S.l. 
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Figure 3.1: 
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— C{Xil) -\- U2 -T U3 + 1/4 + Us -|- -t i/23 + U;?,} 

+ lt35 + U^e"^ + U45 + U56 + lt345 + "356, (3.3) 

where some xi^'^-term may be zero. 

Devote the element sets in (3.2) by Oi,---,0^, m that order. Under the 
SHA, our interest is on the u%"^-tcrm in (3.3). If u^^'^ ^ 0, then the CLLS 
CSi is given by Si = {6\,04,Or,] . Otherwise, z/itj^"^ ^ 0, then CSi is given by 
Si = {{2},<?4,^?5}; t/t4""^ = 0, then CS, is given by S; = {04,0^]. 

The LLSs S},S2 and are dfpictcd by the graphs (a), (b) and (c) in 
Figure 3.2, respectively. 

12 
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Figure 3.2: 
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f«; {h) (c) 

The point here is that both ^23"' and t43'^^ cannot be zero. // both were 
equal to zero, then by Lemma 3.1, 

"123 = "23 - 0, 

xi'hich contradicts the LLS S. The same argument applits to U2'. Table S.l 
shows the set of all the possible pairs of CLLSs. 

□ 

We note that the union of the element sets appearing in CSi or CS2 
{2, 3,4,5, 6}, which is equal to Uegs^\{i}- Generalization of this resnlt follows. 
For a LLM M, denote by Q{M) the generating class of M. 

Theorem 3.3 Let Hybrid(CSi,- ■ ■ ,CSiJ be a hybrid from a LLM M . 'Then, 
under condition (2.4) and the SUA, we have 

U ^^={ilu( U 0). 

aeQ(M) ffec5iu..u(7,'.',, 
13 
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Tabl e 3.1: The possible pairs of the CLLSs 
Xi = xii 



Xi = Xi2 i 


Si 


^2 


5-3 


Si 


0' 


0 


0 


S2 


0 


X* 


X 


S3 


0 


X 


X 



*Note.: O ''neans that the corresponding pair of CLLSs are possible; x inipo.'^f^ibh . 



Proof: Let Z = uiliC Si. Fove eg{M)J\{l} G Z by LemmaS.L if {1} C 0: 
if {l}n0 = 0, then the u-terms in the second summation of (2.9) implies 0 ^ Z. 
Thus 

U 0C{l}U{[j6). 
eeG{M) eez 

On the other hand, for 9 ^ Z, we can see, from expression (2.9) and by 

Lemma 3.1, that the true hypermodel involves at least the X-variables with 

subscripts in 0. Thus, by assumption (2.4), we have 



U 0D{l}U{[j9). 
eeG{M) eez 



O.E.D. □ 
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Now we have seen that we don't have to worry about tlir categorical vari- 
ables not involved in a hybrid, when we try to guess the true hypcrmodo' 

14 



corresponding to the hybrid. 

By comparing (2.1) with (2,9) of Section 2, and by applying L(.'iuina 
we can summarize the ])ossible CLLSs from a given LLS as in the Uieoreni 
below, whose proof is briefly outlined. 

Theorem 3.4 Consider a LLS S = {^i, • • • , 61^}. Suppose, for 1 < .s < /■ < k. 
the following three conditions hold: 

(i) 1 e Oj, for 1 < i < r; 1 ^ 0, for r <j < k. 

(ii) For I < j <s, there exists I, r < t < k such that cOi. 

(iii) For s < J < r , 0^ \ {\} Ot for every I, r < I < k. 

Then, under the SUA, xve have the following results: 

(i') For \ < j < s, Oj \ {\} does not show up in ani/ CiJ.S of S . 
(ii') For s < j < r, Oj \ {1} shoxvs up in at h<i.st one CIJ.S of S . 
(iii') For r < I < k, Oi shows up in all the CLLSs of S . 

Proof: Hesults (i ) and (iii") aie ininiodiate from (2.1) and (2.!)). Uesiilt (ii') 
follows from (2.1), (2.9), and Lemma :^.l. Q.K.D. □ 

m 
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In this section, we h^/e seen the possible CLLSs of a LLS. It is also shown 
that only the categorical variables involved in the CLI.Ss are involved in the 
corresponding LLS. In Theorem 3 4, we have seen that some components in a 
LLS are kept or shrunk, or disappear in a corresponding hybrid. A careful look 
at Theorem 3.4 may help us in guessing the unknown LLS (or true hypermodel) 
from the given hybrid. 
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4 Basic Results for Hypermodels correspond- 
ing to a Hybrid 

In the expression of the CLLMs, we must be careful in using j/-teniis 
and u^'^-terms. We will use Vg^ in the CLLMs to denote either uq or Ug \ for 
0 ^ 4' = ^'e when ue 7^ 0 and t/.{i}u9 = 0; I'g ' = when uq 7^ 0 and 
"{i}uo 7^ 0- In this section, we use jz-terms only for the log-linear expression ol 
the hypermodel. In other words, we will use jz-terms for a LLM and '-terms 
for a CLLM. 

Theorem 4.1 Consider the hybrid Hybrid{CS\,- ■ ■ ,CSiJ, and svpposr. thai 
a set 0 is common in all the CSs and that the trne hyprrmodr.l is (jraphiral. 
Then, under condition (2./,) and the SUA, the trne graphical hypermodel c.t 
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Hybrid (CSu ■■■ , CSi^ ) contains one of the follo^inngs: 

eu{i}, Us?}, for III c-^ CO. (i.n 

Proof: Let 

z = U cSi, 

and 

D=[jp. 

By Theorem 3.3, it is necessary and sufficient to use only tiie variables wiiose 
subindices are in the set D. 

For A'l = ,ri;, / = 1, • • • , A, we express the LLM for C'St by 

logPD|i( •Tol-r,;) = f^'"' + c{xv) - "i(.„)- ( l--' 

pCD 

Since all of the CLLSh contain 0, 

t-S"'^ 7^0, for.r, =xn,---,.r,/,. (■^■■^) 
From expression (4.2), we have 

= E + 

Tile candidacy for the true hypermodei is determined l)y wiieliier r|;^ in ( 1.1) is 
zero or not. From tiie condition of the theorem, we know tiiat f H { 1 } = 0. For 
eacii / e {1, • • • , /i}, Vg '"^ ^ 0 imph"s. under tiie SUA, one of tlie foiiowiuRs: 
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(i) "{i}utf(xi,, xg) = 0 and ue^ x^) + 0. 



(ii) u{i}ue(xi,, + 0 and 



^0. 



Hence, there are 2^' possible combinations of cases (i) and (ii) for (u, 



0 



' ' ). However, if case (ii) holds for at least one / in {1, 2, ■ • • , /j}, then 



By Lemma 3.1, anj' set that contains {1} as a subset can not be contained 
in jf/, since such a set doesn't show up in any of the CLLSs. 

On the other hand, if case (i) holds for all / =^ 1,2, then we can 

see that ^ is in // but not {1} U 9. In this case, we must not forget about 
the possibility of the fizsappearm*/ u^'^-terms in some CLLS. Under the premise 
that the true hypermodel is graphical, the consideration of the disappearing 
u^'^-terms gives rise to one of the followings as a subset of //: 



(4.5) and (4.6) are combined into (4.1). Q.E.D. □ 

An alternative situation of the condition of Theorem 4.1 is considered in 
the theorem below. 

Theorem 4.2 For the hybrid Uyhr'idfCSi,- ■ ■ ,CSiJ. suppose that 0 is an 
clemt'-t set of some CS, and not common in all the CSs, and that there is no 



{1} U ^ is contained in the true hypermodel, H, say. 



(4.5) 



{6,ipL>{l}}, for <DCipc9. 



(4.6) 
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set in ull^LLSi \ 9 that contains 9. Then, {1} U 0 is in the tnit hypc-nnodd 

Proof; The condition of the theorem says that 0 is contained in at least one 
CS. Without loss of generahty, let one of such CSs be CS^. It is easy to sec 
that there are three disjoint and exhaustive sets A and B such that 

A = {_/; CSj contains 0], 
i? = {l,2,---,/i)\A. 

For CSu ^'S""^ = 4""^ or ug. That u^""^ = ug means, for ^ = 1. 

"{i}uO(j:i,. ;ce) = 0 and uo{Xg)'/'0. ^ xg. (1.7) 
And that Vg"^^^ = i/g^"^ means, for ? = 1, 

U{i)ue{Tu, To) / 0 and / 0- (•^••'^) 

Recall that all the CLLMs for a LLM are given by the same format as in ('!.'.)) 
and that the variation among the CLLSs is due to whether r*'^ is zero or not. 

For j G 5, either 

"{l>uO(x,j, Xe) = .'•«) = 0, V Xg ('l.;)) 



or 



f'{UuO(.r,j, .r„) / 0 and ?/{i}uO(r,j, .t„) + ,r„) = 0, V xp. ( 1. 

19 



24 



l{ A = {1}, either (4.7) or (4.8) is possible. If {1,2, •••,/!} D AD {1} 
(note that B ^ ^ by the condition of the theorem), one or both of (1.7) and 
(4.8) are possible. It is worth noting that one and only one of (4.9) and (4.10) 
holds for all j G B, since = 0 in (4.9) while ite 7^ 0 in (4.10). Thus, one and 
only one of (4.9) and (4.10) is possible for all j G B. Considering all of these 
situations, we have the following 10 cases we can think of from the condition 
of the theorem: 

(Case-1) A = {1}; (4.7) for CSu and (4.9) for CSj, j G B. 

(Case-2) A = {1}; (4.7) for C5i, and (4.10) for CS,. j G B. 

(Case-3) A= (4.8) for CSu and (4.9) for CS,. j G B. 

(Case-4) A = {1}; (4.8) for CSu and (4.10) for j G B. 

(Case-5) /I D {1}; (4.7) for CSi, i G A, and (4.9) for C5j, j G B. 

(Case-6) A D {1}] (4.7) for C5,-, i G A. and (4.10) for CSj, j G B. 

(Case-7) A D {\}\ (4.8) for C5„ 1 G A, and (4.9) for CSj, j G B. 

(Case-8) A D (4.8) for CS,, i G A, and (4.10) for r\%, j G B. 

(Case-9) A D {1}; (4.7) and (4.8) each at least once for T.S, ? G .4, and 
(4.9) for CSj, J G B. 
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(Case-10) A D {1}; (4.7) and (4.8) each at least, once for C.% i G A. and 
(4.10) for CSj, J e B 

Only 3 of these 10 ".ases are possible. They are cases 4, 8 and 10. Now we 
will investigate why the other« are impossible. 

For Cases 1, 5 and 9: (4.7) and (4.9) are not compatible (see the uq term). 
For Case-2: From (4.10), it follows that 

If (4.7) and (4.10) hold together and \i A = {l}, then 

E'Mi}ufi(.„?«) = -(^i-lK.r«)^0< V -so. (1.11! 
which is a violation of the constraint 

For Cases 3 and 7: (4.8) and (4.9) are inconipatihle (note that by the 
hierarchy assumption (4.8) implies ue 7^ 0). 

For Case-6: Denote the number of the elements \v A by #(.4). Hxprcssiou 
(4.11) now becomes 

5I"{i)uo(r,, ;r„) = -(^1 - #(^))"fl( .r») 7^ ^ 

which is again a violation of the constraint (4.12). (Note that /i - #(/\) > 0 
by the ct)n(lition of the thcorcnn.) 



Wo now turn to the 3 possible cases. 
For Cases 4 and 8: It is possible that (4.8) and (4.10) hold under the 
constraint (4.12). 

For Case 10: For A > 3, it is possible that such {u{i}ue} exist that satisfy 
the conditions of this case under the constraint (4.12). 

In each of the above three possible cases, it is obvious that {1} U 0 is 
contained in the true hypermodel. By the condition of the theorem, at least 
one of tlie three possible ca.ses must occur. Therefore, we have the desired 
result. □. 

The conditions of Theorems 4.1 and 4.2 do not seem to cover all the possible 
situations for each element set in UJLi CSj. The only situation not roiiHidered 
yet is that a set 0 is in some CLLS and not common in all the CLLSs and that 
there is a set in some CLLS which contains 61 as a strict subset. But we don't 
have to worry about the situation. If there is a set ip in some CLLS which 
is not a strict subset of any other set in the union of all the CLLSs, and if 
0 C v^, tluMi under the SHA we can see, in the light of expression (2.9), thai 
the 0 corresponds to a (b)-t.ype term and corresponds to a ?/''"''-term in the 
first suniniation of (2.9). Thus we don't have to worry about the 0 in search 
of the true hypermodel (since the 0 does not affect a LLS under the hierarchy 
principle). Instead we have to concern ourselves with tp and apply Theorem 
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4.2. Tlierefore, we have essentially considered all the necessary silnatioiis of 
the relationship hetween the sets in U/=i ^''-'^t in search of the true hypernujdcl. 

Now the question is if the rules derived in Theorems 4.1 and 4.2 lead us 
towards the true hypermodel from a given hybrid. 



5 Traceability of True Hypermodel 

Consider a hybrid of . In the searching process for the true hyj^er- 

model, we basically have to pay attention to all the element sets of uLi C'S- 
However, as indicated at the end of the preceeding section, we have only to 
concern ourselves, under the SHA, with those element sets each of which sat- 
isfies cither the condition of Theorems 4.1 or the condition of 4.2. This point 
is well noted in the process described below. 

Suppose there are L element sets, V'i>---,V''L- hi U,=i If (\ does not 

satisfy the condition of Theorem 4.1 nor the condition of Theorem 1.2. let 
7', = 0; otherwise, apply either Theorem 4.1 or Theorem 4.2 to e, accordiiif^ly. 
If Theorem 4.1 is applied, let 



7; = { v, U { 1 } , { (,\-, { 1 } U V-} , for 0 C c v, } ; 
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if Theorem 4.2 is applied, let 

T. = {{l}Ut/>,}. 

Then we can obtain a collection C of the sets, where each set. is composed 
of the element sets from the T.'s, one element set from each T,-. For each set 
C in the collection C, if there is an element set c in C which is a subset of 
another element set in C, then we remove the element set c from C. Note that 
we don't need such an element set c under the hierarchy principle. We will 
call this removing process by Subset- Removing process. When the subsets are 
removed from all the sets in C, we denote the new, subset-removed collection 
by C. 

For example, we consider a hybrid of {{2}, {3,4}}i and {{2, 3), {3, 4}}2. 

UC5.= {{2},{2,3},{3,4}}. 
1=1 

Let V'l = {2}, V^2 = {2,3} and ^3 - {3,4}. Then, 1\ = 0, n = {1,2,3}, and 
Ta = {{1,3,4}, {{3,4}, {1}}, {{3,4}, {1,3}}, {{3,4}, {1,4}}}. 

By selecting one element set from each T,-, we have 

C = {{{1,2,3}, {3,4}, {1}}, {{1,2,3}, {3,4}, {1,3}}, 
{{1,2,3), {3,4}, {1,4}}, {{1,2,3}, {1,3,4}}}. 
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After subset-removing, we have 

C = {{{1,2,3}, {3,4}}, {{1,2,3}, {3,4}, {1,4}}. 
{{1,2,3}, {1,3,4}}}. 



We will call the process from hybrid to C the hypermodelling proc(ss. 
Now our question is whether we can find the true hypermodel in C'. llio 
theorem below addresses this issue. 

Theorem 5.1 fTraceability Theorem^ Suppose that the trxif hyptrmodd 
is graphical. Then unckr condition (2.4) and the. SHA, the subset-removed 
collection C which is obtained through the hypermodelling process contain.'' tlx 
true graphical hypermodel. 

Proof: Denote the true hypermodel by //. Without loss of generality, wo may 
take 

// = l^j, • • • , 0n.}i for some subscript sets 0,, i=\,---.k. 

Suppose that we regroup the element sets of // according to whether an 
element set contains "1" and whether, {ov 0 e H , 0\{l} C 0' for some 0' e // 
which does not contain "1". (Note that, by condition (2.4) and tlje liierarcliy 
principle, there exists at least one element set which contains "1".) Tlien // 
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is divided into 3 exhaustive and mutuallj' exclusive subgroups where some 
subgroups may be empty. If {Oi}\ satisfy the conditions (i), (ii) and (iii) of 
Theorem 3.4, then the three subgroups are 

{^i,---,^,}, {es+u---,Or] and {Or+u-'-Jk). (5.1) 

(Note that s can'ljei' equal to 0 or r and that r can be equal to 1 or k.) 

^j, r+l<j<k, appear in all the CLLSs (by (iii') of Theorem 3.4). By 
Lemma 3.1, each 6j, s + I < j < r, appears in at least one of the CLLSs. 
None of {9j]l appears in any CLLS, whose corresponding u^'^'^-terms have 
been dul)bed "disappearing" terms in section 2. Thus Ui' CSi is composed of 
I he sets, Oj, j = s + I, - ■ ■ ,k, and some subsets of some Oj' , j' - .s 4- 1 , • • • , /•. 

We will use T, for the same meaning as previously described. As indicated 
in the above paragraph, 6j\{l}, j = 5 + 1, • • • , A:, are in Ui' (note that 
for j = r + 1, • • • , k, Oj \ {1} = 9j.) For any sot in 

yc5.j\^y(<?A{i})j, (.'3.2) 

there exists,if 3 > 1, j' G {.s + 1, • • • , r} such that the set is a subset of Oj'. As 
shown at the end of the previous section, the sets in (5.2) do not affect the LIS 
of a hypermodel mider the hierarchy principle. Thus the sots may be ignored 
in the hyperniodelling process. In other words, we may concern our.selvcs onlj' 
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with tlie sets Oj \ {1}, j = 5 + 1,---,A:, in U(' (-S, in the hypennocleUiii!; 
process. For z = .s + 1 , • • ■ , k\ let 7',- correspond to 0,- \ {1 } in U|' CSi. 

Assuming H as the true hypermodel, with its element sets regrouped as in 
expression (5.1), we will prove that H € C*. 

For ; € {r + 1, • • • , k) , we apply Theorem 4.1 and get 

r, = {^,U{l},{^„{l}Uc^}, for H^C^^cO,}. (0.3) 

For Oj, s + l < j < 7\ we apply either Theorem 4.1 or Theorem 4.2 according 
a.s whether Oj is common in all the CLLSs or not. If Oj is common in all the 
CLLSs, Tj is given by (5.3). Otherwise, 

T, = {e,}. (.^.1) 

For each 6j, j = r + 1, • • • , A% there is at most one / in { 1, • • • . .s} such that 

{Oy\{l})cO,, 

since the true hypermodel is graphical. Without loss of generality, we may 
suppose that 

{0,\{^})C0,^„ for ? = l,---,,s. (.'-,.5) 
(Note that under the grapliicality assumption s < k — r.) 
Therefore, we have: 
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(i) For Oj. + 1 < < r. Oj G Tj. (Sec (5.3) and (5,4).) 

(ii) For J e {1, • ■ ■ .5}, {0,J,+r} e (See (5.3).) 

(iii) For Oj, r + s < j < k, {9j,{l}} G Tj. (Note that (2.4) is assumed and 
that if r > 1, then {1} will not show up in the expression of a LLS under 
the hierarchy principle.) (See (5.3).) 

The results, (i), (ii), and (iii), mean that // is in C. So far we ha\c 
considered the cases where 1 < s < r < k. But the result also holds for s = 0 
by (i) and (iii). This completes the proof. □ 

We will illustrate the Traceability Theorem for the liypermodol given in 
Example 3.2. Before the illustration, we introiluce a subset -removing operator 
( ) on a set. For example, 

({1,2}, {2,3}, {2.3,4}) = {{1,2}, {2,3,4}}. 

Example 5.2 For th( hypcnnodd S in (.rpnssimi (3.2), flu pofisiblr pair.^ of 
CLLSs are shown in Table S.l. We will vs( flic sanu notation as in ExampU 
Sil) For the hybrid Hybri(I(SuSi); 

C = {({1} U^-,.(?;.{l}UvM,<'4.{l}Uvr^„tr.); 

for 0 C -^1 C C ^4 C 0 C ^r, c Os). (5.6) 
28 
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(2) For the hybrids HybridfS-i, S2) and Hyhrid(S\, Sn,) : 

C - {({l}u<?;,{l}Uv''4,<?4,{l}Uv'.5,<?5); 

/or 0 C (^4 C ^4,0 C (^5 C ^s}- (5.7) 

Wc can sec that the true hypermodel S is included in each of (5.6) mid (5.7). 
□ 
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6 An Illustration 

Consider an educational test which consists of items tiiat can be soivtMl 
by one of two problem-solving strategies. In this section, wc will illustrate an 
application of the main result of this paper supposing that we are given two 
influence diagrams (Oliver and Smith (1990)) of item responses and abilities 
for two problem-solving strategies each. Both influence diagrams (IDs) have 
the same structure as in Figure 6.1. In the figure, "A" stands for ability and 
"X'' stands for item score, where .Y,- is the item score for item /. The arrow 
from Ai to A2 stands for that possession of ability Ai requires possession of 
ability A\ as a prerequisite. The arrows to the node of an item score from a set 
oi the nodes of abilities mean that the abilities are tapjjed by the corresi)()n(liu>!, 
item. For instance, item 1 taps al)ilities denoted hy Ai and A-^: and item 2 
taps the al)ility denoted by A^. The marginal or conditional probal)ilities for 
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Figure 6.1: A fonditional ID 




the IDs are given in Figure 6.2, wliere panel (a) shows tlie conditional ID given 
the problem-solving strategy ST-1, as does panel (b) given ST-2. Suppose the 
l)robal)ilities in Figure 6.2 are obtained from two distinct data sets, for each of 
the two strategies separately. From the conditional IDs in Figure 6.2. assuming 
that P{ST = "ST-l") = O.-l. we generated two data sets, one (of size 768) 
from the model in panel (a) and the other (of size I2IV2) from panel (b). As 
anticipated, the log-linear analysis of the two data sets ends up with the LLS 
as in (6.1) for both data sets. Table 6.1 shows the result of the analysis. 

We may ignore the directions in the graphs (see Lauritzen and Spiegelhalter 
(1988)), and regard each graph as a two-clique graph for the four categorical 
variables, .4i,/l2,A'i, and A'2. For both of the strategies, the CLLSs of the 
four categorical variables are 

{{.•!,, .V,}. {.4,, .4,,A,}}. (6.1) 
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Table 6.1: Log-linear analysis of the data from the panels (a) and (b) of F'igure 
6.2 

(a) For ST-1 (data size = 768) 



Conditional LLS 


d.f. 


Likelihood-ratio 


Pearson 




Prob. 


\^ 


Prob. 


{{.4i,A-2},{/l,,/l2,A',}} 


6 


2.80 


0.83 


2.82 


0.83 


{{A'2},{/li,A2,A'i}}- 


7 


288.11 . 


0.00 




. 0.00 


{{/li,A2},{/ti,.42},{.42,A'i},{/li,Ai}}' 


7 


36.50 


0.00 


50.09 


0.00 


(b) For ST-2 (data size = 1232) 


Conditional LLS 


d.f. 


Likelihood-rat io 


Pearson 




Prob. 




Prob. 


{{A,.X2}.{AuA2.X,}} 


6 


9.00 


0.17 


9.06 


0.17 


{{A'2},{/li,.4.,,Ai}}' 


7 


144.77 


0.00 


157.27 


0.00 


{{/li,A'2},{AiM2},{A2,Ai},{/li,A'i}}- 


7 


43.07 


0.00 


47.89 


0.00 



As appeared in the BMDP onti)nt, using a 



backward-deletion ai)[)roacli. 




As shown in the proof of TraceabiHty Theorem, we may confine ourselv<>s 
to the sets, {Ai,X2} and {Ai,A2,Xi}, in the hypermodelling process. Sincc^ 
the two sets are common to both CLLSs, by applying Theorem 1.1 to each of 
the two sets, we can see that the largest possible hypermodel is given by 

{{ST,AuX2]AST,AuA2,Xr]], (6.2) 

where "ST" is for the categorical variable of the strategy. By the collapsibility 
theorem (Theorem 2.5-1 of Bishop et al. (1975)), we may investigate the 
interaction within the individual sets marginally. The standardized (/-terms 
(see section 4.4.2 of Bishop et al. (1975)) of the three-factor and the four-factor 
effects for the first and the second sets respectively in (6.2) can be estimatcMl 
and used in searching for the true LLS. This may be a useful approach when 
data are not available for all the five variables. 

However, we will take advantage of the simulation set-up where data are 
available for the five variables. The TraceabiHty Theorem says that the true 
LLS is either the model in (C.2) or a submodel of it . 

Actually, the model in (6.2) fits well to the sinuilated data sel from the 
hybrid of the conditional IDs. The observed chi-scjuare was 11. SO with 12 
d.f., whose upper-tail probability is 0.46. Table 6.2 shows a HMDP ()u1i)ut of 
luodcl-fitting by backward-deletion that starts from the LLS in (6.2). 
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Table 6.2: Model-fitting result by BMDP 



LLS 


d.f. 


Likelihood-ratio 


Pear.son 








Prob. 




Prob. 


{{ST,/4i,X2},{ST,A„/l2,A'i}} 


12 


11.80 


0.46 


11.88 


0.46 


{{ST,Ai,A'2},:/li,/l2,Xi},{5r, A2,Xr}, 












{.9r,Ai,J^i}, {ST,Ar,A2}} 


13 


17.72 


0.17 


16.98 


0.20 


{{STAuX2},{AuA2,X:}, 












{ST,A2.X,},{ST,AuA2}} 


14 


20.24 


0.12 


19.27 


0.15 



Table 6.2 suggests two other LLSs, {{ST,A,, A'2},{/li, A^, X, },{.97\ A2, A, } . 
{5r,/li,X,},{5r,.4,,/l2}}and{{ST,A,,A'2}a^i,^,A^},{5T,/l2,Ai},{5T,.4,..4,}}. 
as possible candidate LLSs for the data. But neither of them is graphical. 

We can represent the relationship among these five variables bj' an ID, 

still preserving the relationship among them in the language of LLS. We can 

construct an ID of the five variables in the following order' of the variables: 

Ai,A2,ST.X2,Xi. (See Pearl, Geiger, and Vcrma (1990).) We have an ID of 

the five variables in Figure 6.3, which is obtained as follows^. A] and A2 do 

not look independent (x|,j=2i}2.18), and by the prerecjuisite relation between 

'Any other ordoriiig will do. But this ordering rcllcct.s the liilicrciif relationship among 
the vari;il)les. 

"'See the tahle in t.he appendix for the simulated l're(iueiicy liihle of the live variables. 

34 

O on 

ERIC 



tliein, we put an arrow from node A\ to node A2. The three variables, Ai^A-i. 
and ST, look second-order associated (No submodel of the saturated LLS of 
the throe variables fits well). Choice of a problem-solving method is subject 
to a test-taker's knowledge state and experience (Greeno and Simon (1988)). 
In other words, the test-taker's ability level may influence his or her choice of 
a problem-solving method, which is depicted by two converging arrows from 
the nodes of Ai and A2 to the node of ST. The three variables, A^ST, and 
A''2, make a clique in the LLS of the five variables, and the item score X2 is 
influenced by the state of Ai and the value of ST (i.e., the strategy selected). 
This can be expressed by 

A2LX2\{AuST} (6.3) 

in the ID construction process (see Corollary 4.1 of Pearl, Geiger, and Verma 
(1990)). The relationship in (6.3) is supported by the simulated data. Marginally 
forthe4 variables, ST,Ai,A2Aud A'2, the LLS, { {.ST, /tj, /I2}, {5r, A,, A'2} }, 
has the likelihood-ratio chi-square value 5.73 with 4 d.f. (its upper tail prob.=0.22). 
Finally, the four variables, /li, /I2, ST, and A'l , make a clique, and, by the same 
roa.soning as above, we drew arrows from Ai,/l2, and ST to A'l. This coni- 
Ijletrs an ID for tlie five variables, Ai,A2,ST,Xi, and A'^. The maximal or 
t-onditional probabilities for tlie ID are given in Figure 6.3. Tlu>se probabilities 
arc frccjuenry probabilities based on Table A.l in the app(>ndix. 
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The covariance structures of the five variables from the conibiiied ID (Fig- 
i ure 6.3) and from the hybrid of the conditional IDs (Figure 6.2) were very closc^ 

to each other. The test statistic for the null hypothesis, that the covariance 
structures are the same, asymptotically follows the chi-square distribution wit. li 
15 d.f. (Section 10.6.1 of Anderson (1984)). For simulated data of size 3,000 
(we regenerated them) from each of the two ID models, the combined ID and 
the hybrid of the conditional IDs, the observed value of the test statistic was 
17.246 (p-value=0.304). 

This section has illustrated an application of the main result of l.lir pai)fr 
to the problems of searching for a LLS using its conditional LLSs. In rertlity. 
data for "ST" may be missing. In this situation, however, we can apply \hc 
main result of the paper by assigning a prior distribution on the unobserved 
variable. 

7 Summary Remarks 

A main result in this paper is that we can see partial iiiforniation about 
a true LLS from its CLLSs and use the information in trying to guess the 
true LLS. But we would face uncertainty, as indicated in Tlieorein 4.1, in 
the hypermodclling process, when all the ('LLSs contain at least one conunon 
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subscript, set. Under tlu- assumption that tlie true LIS is grapliicai, we can 
search for the true LLS by focusing on the "largest cliques" that are obtained 
from the CLLSs, as illustrated in the previous section. 

The hypermodels are confined to be graphical in this paper. However, we 
can easily extend to the case where the true hypermodel is hierarchical. In 
such a case, the set of the possible hierarchical hypermodels will become much 
larger. If all the CLLSs of a hybrid are the same and given by {2, 3, • • • , n + 1 } 
for 1 < n < cxD, then the number of the possible graphical hypermodels is 
given by (j„ = 2"; while that of the possible hierarchical hypermodels is given 



1=1 



n 



V ' / 

where /,(;') is the number of j non-empty subgroups of a set of i distinct 
elements, and its values, for 1 < i < i < 0, are given in Table 7.1. The 
numl)ers /y,, and /;„ of up to /) = G are given in Table 7.2. 

Besides the numerical advantage, the graphical log-linear structure would 
l)e a natural means when wc deal with exptn'ts' o])inions which are given 
via causal networks or influence diagrams (see Lauritzen and Spiegelhaulter 
(1989), Shachter (1986) and Smith (1989)). As illustrated in section 6, the liy- 
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Table 7.1: Values of /.(j) for 1 < j < i < 





1 


2 


3 


4 


5 


6 


1 














2 




1 










3 




3 


1 








4 




10 


6 


1 






5 




15 


25 


10 


1 




6 




31 


90 


65 


15 


1 



Table 7.2: Values of </„ and hn for 1 < n < 



n 


9n 


/i„ 


1 


2 


2 


2 


4 


5 


3 


8 


15 


4 


IC) 


55 


5 


32 


218 


6 


64 


922 
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permodelling process may be useful when we want to build a larger graphical 
model from a set of conditional graphical models. Although a simple example 
was used in this paper, the approach can be applied to any sized graphical 
models in principle. 

In this paper, each of the CLLSs in a hybrid involves the same set of 
variables. But it may not be unusual that the CLLSs involve different sets of 
variables. The difference could be due to nouresponse or variable-omission, the 
result of this paper is not of immediate use unless the missingness is ignorable. 
Research in this direction deserves our attention, and the author of this paper 
is currently exploring this issue. 
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Appendix 



A2 




A 

Ai 


A 


ST 










ST-1 


ST-2 


U 


U 


U 


U 


65 


40 








1 


7 


61 






1 


0 


8 


19 








1 


4 


16 




1 
1 


n 
U 


n 
U 


19 


38 








1 


5 


47 






1 
1 


U 


1 


12 








1 
1 


42 


138 


1 


n 
U 


U 


U 


9 


20 








1 
1 


101 


144 






1 
1 


n 
U 


1 


G 








1 


24 


64 




1 


0 


0 


4 


9 








1 


38 


129 






1 


0 


0 


5 








1 


440 


484 
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